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Abstract. Growing networks have a causal structure. We show that the causality strongly influences 
the scaling and geometrical properties of the network. In particular the average distance between 
nodes is smaller for causal networks than for corresponding homogeneous networks. We explain 
the origin of this effect and illustrate it using as an example a solvable model of random trees. We 
also discuss the issue of stability of the scale-free node degree distribution. We show that a surplus 
of links may lead to the emergence of a singular node with the degree proportional to the total 
number of links. This effect is closely related to the backgammon condensation known from the 
balls-in-boxes model. 
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INTRODUCTION 

Many real-world complex networks have heavy tails in the node degree distribution 
[1, 2, 3]. It was a great discovery [4] to realize that networks with such a property 
can be naturally produced by a preferential attachment growth [5]. It has become an 
important issue to work out the consequences of the presence of heavy tails in the degree 
distribution on topological properties of networks [1, 2, 3]. 

The degree distribution however gives only a crude information about topology of 
random networks unless it is supplemented by some other information. Let us take as an 
example ^-regular graphs. They are defined as graphs with all nodes having degree equal 
k. The node degree distribution of ^-regular graphs is p(q) = S q ^ independently of the 
graph topology. A regular equilateral triangulation or a cubic lattice are perfect examples 
of 6-regular graphs but they have completely different topological and geometrical 
properties: the triangulation has fractal dimension two while the cubic lattice - three. 
One can see the difference when one determines for example the number of nodes in the 
given distance from the given node. 

The two examples of 6-regular graphs are not random graphs so one could argue 
that maybe if one considered random graphs from the ensemble of 6-regular graphs 
one would see statistically identical topological and geometrical properties of them. But 
to answer this question one has to define what a random graph is. One can do it by 
introducing a statistical ensemble of graphs [6, 7, 8, 9, 10, 11, 12, 13, 14]. This means 
that one has to specify the set of graphs which one wants to study: as for instance 
^-regular graphs, trees, simple graphs, pseudographs, etc., their attributes: directed, 
undirected, Eulerian, connected, etc. Then one has to define a probability measure on this 
set by ascribing to each graph from the set a positive number. This number gives (after 
normalization) the probability that the graph will be selected when the set is sampled 
randomly. The statistical properties of the ensemble heavily depend on the choice of the 



probability measure. 

The classical example of this approach is the ensemble of Erdos-Renyi graphs [15, 
16, 17]. The set of graphs consists of all graphs with /V nodes and L links having neither 
multiple- nor self-connections. The measure in the Erdos-Renyi ensemble is defined as 
follows [14]. One labels all nodes by the integers 1, . . . ,N to obtain labeled graphs. Such 
graphs are isomorphic to N x N symmetric adjacency matrices with L entries equal to 
one in the upper triangle (and symmetrically also in the lower triangle). Each labeled 
graph (adjacency matrix) has the same statistical weight. The partition function reads: 

z A =x;i=i>(g). (i) 

ig g 

The index h stands for homogeneous and will be explained later. The first sum runs over 
all labeled graphs, the second over graphs (=unlabeled graphs). By a graph we mean 
graph's topology that is the shape or skeleton which one obtains when one removes 
the labels. The number of distinct labelings (adjacency matrices) of the graph g 
depends on g and thus some graphs are more whereas some are less probable in this 
ensemble. 

One should realize that the definition (1) assumes that a) we can distinguish the 
vertices of the graph, b) we are only interested in properties that do not depend on 
the labelings. The permutation of indices neither changes graphs topology nor physical 
quantities measured on this graph. The model has thus permutation symmetry and one 
should divide out the volume of the permutation group. One could explicitly write the 
factor of l/Nl instead of 1 in the sum (1). However, as long as /V is fixed this factor 
is constant and therefore can be pulled out in front of the sum and skipped as an 
irrelevant normalization of the partition function. One should keep it in any formula 
for an ensemble with varying N as we will do in the next section (see for example (4)). 
The origin of this factor is the same as in classical statistical mechanics for identical 
particles. These assumptions are reasonable for many real-world examples of graphs, 
but one must be aware that they can be violated. For example when considering chemical 
compounds we must treat the vertices of the same kind as indistinguishable similarly as 
in quantum statistics. Then we should use an ensemble where each graph topology has 
the same weight: Z = £ g 1. In practice such a definition turns out to be much more 
difficult to handle [18]. Also the ensembles, where the probability of selecting a given 
graph depends on its labels, appear naturally when one considers ensembles of growing 
networks [19, 20]. 

In the Erdos-Renyi ensemble (1) all labeled graphs are equiprobable. One can however 
weight the graphs in a way which depends on their topology: for example one can 
introduce correlations between neighboring nodes degrees [8, 21], one can favor loops 
[10] which are very rare for the Erdos-Renyi graphs, or one can introduce a weight 
which modifies the node-degree distribution [6, 22]. In the last example one does it by 
modifying the partition function to the following form: 

Z h = Y J w(q\)w(q 2 )...w(q N ) : (2) 
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where q^ is the degree of k-th. node and w(q) is an arbitrary weight function. One can tune 
the weight function w(q) in order to obtain graphs with a desired probability distribution. 



In this way one can produce for example scale-free graphs with the Barabasi-Albert 
degree distribution [6, 14]. 

In this contribution we apply the concepts of statistical mechanics like statistical 
ensembles, partition function, averages over ensemble, and so forth to study random 
graphs. Within this approach one has already obtained many interesting results [6, 7, 8, 
9, 10, 11, 12, 13, 14]. This approach is a straightforward generalization of the Erdos- 
Renyi ideas. 

At the first glance one can think that the statistical mechanics is not adequate for 
growing networks which are not in equilibrium. Indeed such ensembles cannot be 
understood as equilibrium ensembles with the Gibbs measure, having temperature etc. 
One should rather understand them as follows. Imagine that one repeats the process of 
growth many times independently and each time one terminates it when the network 
reaches a given size. One obtains a collection of networks which occur with a certain 
probability. This perfectly defines an ensemble of graphs [7, 19]. It is not an equilibrium 
ensemble but all methods of statistical mechanics work, so one can use them. In order 
to distinguish growing networks which are inhomogeneous from homogeneous graphs 
discussed above (for which all nodes are treated equally as for instance for Erdos-Renyi 
graphs) we shall call the growing networks - causal, and the networks obtained from 
arbitrarily labeled graphs - homogeneous. The difference will become clear in the next 
section. Anticipating some results, the causal and homogeneous networks have different 
geometrical features even if they have identical node-degree distributions. 

CAUSAL VERSUS HOMOGENEOUS NETWORKS 

Vertices of a growing network can be labeled by integers representing the order of 
attachment to the network. It is clear that not all possible labelings of the underlying 
graph can be realized in this way (see figure 1). We will call those that can be - causal 
labelings or equivalently we will say that the labels are causally ordered. A necessary 
condition for a causal labelings is that a) root has the smallest label and b) one can 
connect every vertex to the root 1 by a path (not necessarily the shortest) in such a way 
that the labels on it are increasing from the root. It implies a condition that every vertex 
except the root must have at least one neighbor with a smaller label. 

As in the previous section we can define a partition function for the ensemble of 
equiprobable rooted graphs with causal ordering: 

Z ciV L = £l = IXg). (3) 
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The first sum is done over all causally labeled graphs clg. This ensemble is a counterpart 
of the Erdos-Renyi ensemble (4) for homogeneous graphs in the sense that all labeled 
graphs are equiprobable. The difference between the two is that only causal labelings are 
allowed here. The second sum in (3) is done over all graphs g. Each graph is weighted in 



Obviously we are considering only connected graphs. For disconnected graph this condition must be 
suitably modified. 



FIGURE 1. Examples of labelings of a simple graph. Two labelings marked with the dashed rectangle 
are causal. The labeling in the middle is not. Also the remaining nine labelings of this graph (not shown 
on the picture) are not causal because the root does not have the label one. 



the sum by the number of its causal labelings. This number is smaller than the number 
of all labelings n^g) of this graph. Although the partition functions (1) and (3) cover 
the same set of graph topologies {g}, their statistical weights are different. When one 
samples graphs randomly in the homogeneous graphs ensemble (1) one thus observes 
different probabilities of graphs' (topologies') occurrence than in the ensemble of causal 
graphs. 

This can be illustrated by calculating various quantities for random tree graphs 
(branched polymers) for which the calculations can be done analytically [19, 20, 21, 
23, 24, 25, 26, 27, 28]. 

Let us give a short account on results of those calculations. We shall discuss ensembles 
of planted rooted trees. Each graph in such an ensemble is connected and has no loops. 
One node of the graph has a single line sticking from it interpreted as the stem of the 
tree (we omit the root node at the end of the stem). The number of branches (links) L of 
a tree with /V nodes is equal to N — 1. The stem is not counted as a branch. Because L 
is related to /V we can skip L in the Z/^l an d denote the canonical partition function for 
trees by Z hN . The grand canonical partition function for homogeneous planted trees is 



and can be represented as a bubble in figure 2. The free end of the bubble denotes 
the stem while the bubble - the sum over all trees weighted as in (4). One can write 
an identical definition for causal trees. For technical reasons instead of the chemical 
potential /i in the grand-canonical partition function (4) we prefer to use the fugacity 



An advantage of using the grand-canonical partition function is that one can deduce a 
closed formula [19] expressing Zh(x) as a function of itself: 
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1 



,Z h (x) 



(5) 
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which can be graphically represented as in figure 2. The meaning of this graphical 
equation is the following. Any rooted tree can be constructed by joining stems of a 



FIGURE 2. Graphical representation of the self-consistency equation (5). The bubble contains the 
grand-canonical sum over planted rooted trees. Each tree is weighed with the size factor and the small 
circle representing a single node gives additional factor x. 



certain number of trees at a common node and attaching a new common stem to this 
node. If one sums over all trees in each bubble on the right-hand side of the graphical 
equation shown in figure 2, one obtains a sum over all trees also on the left-hand side. 
In order to obtain the partition function Z h {x) (4) one has to take care of the power of x 
which counts the number of vertices. We see that the new tree has by one vertex more 
than all subtrees used in the construction so one has to multiply the right-hand side by x. 
In order to avoid overcounting while joining k trees, which now form k branches of the 
new tree, one has to divide out the factor kl which counts the number of indistinguishable 
ways in which one can put the trees together. Thus the expression Z\ which represents 
the composition of k subtrees is divided by k\. Finally, one has to sum over k to include 
all branching possibilities at the root. In this way one reproduces the partition function 
Z h on the left-hand side and xexp(Z h ) on the right-hand side of Eq. (5). 

Using the equation (5) one can determine Z^n by inverse Laplace transform (here 
written as a contour integral): 

N\ f dx 

ZhN = l^ifx^ Zh{x) - (6) 



Changing the integration variable from x to Z = Z h (x) and by means of (5) one obtains 



This result tells us that there are A^ 1 labeled planted rooted trees of size N. There is 
one planted rooted tree for N = 1, two for N = 2, nine for N = 3, etc. The labeled trees 
for N = 1,2, 3 are shown in figure 3. Note that if one removes the stem one obtains an 
ensemble with one marked vertex - the tree is not planted anymore. The number of trees 
with one marked vertex is thus also equal to A^ -1 . From this one can deduce the number 
of all labeled trees without any marked vertex. On a tree with N vertices one can mark 
one of N vertices, so the number of trees with a marked vertex must be N times larger 
than the total number of labeled trees. Thus the number of labeled trees of size N is 
N N ~ 2 . This is the classical result derived first by Cayley (see reference [29] for general 
introduction on counting graphs). 

Let us come back to planted rooted trees, but now consider only causally labeled ones. 
For the trees we have exactly one path joining any given vertex and the root and along 
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FIGURE 3. Planted rooted trees of size N = 1, 2, 3. Causal labelings are marked with dashed rectangles. 



this path labels must increase from the root outwards (see figure 3). One can derive [19] a 
self-consistency equation for causal planted rooted trees similarly to (5). It has the same 
logical structure as the graphical equation shown in figure 2 but the causality imposes a 
requirement on labels ordering. A careful analysis of the consequences implied by this 
requirement shows that one obtains the following self-consistency equation for causal 
trees [19]: 

^£W =e z e M (8) 
cbc 

This equation can be solved for Z c yielding: 

CO 1 CO ry 

ZcW = -ln(l-x)=E^=E^A (9) 

N=l ly N=l ly - 

and hence: 

Z cN ={N-\)\. (10) 

Thus there are (N — 1) ! causally labeled planted rooted trees. For example, there is one 
for N = 1 , one for Af = 2 and two for Af = 3, etc. One can easily find these trees among all 
labeled trees in figure 3. Causally labeled trees form a small subset of all labeled trees: 
the fraction of causally labeled trees Z cN /Z hN ~ N 3 / 2 e~ N quickly disappears when the 
size of the system grows. One may ask whether the statistical properties of this subset are 
identical as of the whole set. The answer is that they are completely different. Probably 
the most striking difference is that the causal trees are much more compact and have 
much smaller linear extent than homogeneous trees. 

One can quantify this statement by checking how the average distance between nodes 
depends on the size of the tree. One defines geodesic distance r a t, for each pair of vertices 
a,b as the number of links of the (shortest) path between a and b. To get the average 
distance one averages r a b over all pairs of nodes on the tree and over all trees in the 



ensemble. The result is that for homogeneous trees the average distance behaves for 
large AT as [21,24,27]: 

(r) h ~VN, (11) 

while for causal ones 2 [19, 30]: 

(r) c ~lnN. (12) 

The power-law growth of the linear extent (r) h ~ N l l 2 with the system size means that 
homogeneous trees have fractal dimension <4 = 2, while the behavior (r) c ~ XnN that 
the fractal dimension of causal trees is d c = °°. The separation of vertices on a causal tree 
is very small and the whole tree structure is compactly concentrated around the oldest 
part of the tree. 

This can be well illustrated by studying the two-point correlation function G N '(r): 

Gf{r) = i^^{r-r ab )^. (13) 

The delta function <5(r — r ab ) selects only pairs a,b separated by r edges. The two- 
point function can be thus interpreted as the probability that two randomly chosen nodes 
of the tree are separated by r links. Alternatively it can be interpreted as the distance 
distribution giving the fraction of nodes at a distance r from a randomly chosen one. By 

construction, L r G^'(r) = 1 and G% J (0) = l/N. The average distance between vertices 
is given by the mean value of the two-point function: 

CO 

(r) = l^rG^(r). (14) 

r=l 

The two-point function gives a very valuable information about the distance distribution 

(2) 

in the given ensemble of graphs. In figure 4 we compare the two-point function G h ^{r) 

(2) 

for homogeneous and G c ^{r) for causal trees of the same size N = 1000. One sees 
that indeed the typical causal trees are much shorter than homogeneous. The distance 
distribution for causal trees is much more concentrated. 

(2) 

One can analytically derive the shape of the two-point function (r) for homo- 
geneous trees in the limit of N — > oo. In this limit one can approximate the shape by a 
function of a continuous variable: 



<2\ , N ar l ar 2 . 



where a is a positive constant. We have: 

jTdrGg>(r) = l, (16) 



In the references [19, 30] only the distribution of distances from the root was calculated. 
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FIGURE 4. The distance distribution functions for the homogeneous (circles) and causal trees (dia- 
monds) for N = 1000. Error bars for data points represented by circles are smaller than the symbol size. 



and 

<r)*« rdrrG%(r) = ^i. (17) 
JO v a 

In the large /V limit the two-point function becomes effectively an TV-independent func- 
tion of the scaling variable x = r/ yjN/a: 

G { V(r)dr = g h (x)dx = xe^^^x. (18) 

When one draws the two-point functions for different system sizes as a function of the 
scaling variable the corresponding curves collapse to a universal shape independent of 
TV. It will be illustrated below for the scale-free graphs. 

The situation is completely different for causal (growing) trees [31]. First of all, 
the mean distance (r) c grows like ln/V and not like y/N. Second of all, the two-point 

(2) 

functions G y ct ^(r) for different sizes do not collapse after the rescaling x = r/ln/V. If the 

scaling hold, the dispersion of (r) would grow like (ln/V) 2 . We shall see below that 
it grows only like ln/V. Let us first calculate the average number of vertices of the causal 
tree n(r,N) in the distance r from the root - a quantity which is closely related to the two 
point function. Since the causal tree is constructed recursively by adding new vertices 
we can write the following recursive relation: 

n(r,N+l)=n(r,N) + n(r-l,N)/N, (19) 

which tells us that the new vertex added in the distance r from the root must be linked to 
a vertex in the distance r — 1 . The factor l/N is just the probability of choosing one out 
of N vertices on the tree, and n(r — 1, N)/N is the probability that we choose a vertex in 
the distance r — 1 from the root. Dividing n(r,N) by /V we obtain the probability that a 
randomly chosen node is at distance r to the root: 



Groot(r,N)=n(r,N)/N. 



(20) 



Multiplying Eq. (19) by r m and summing over r we can calculate moments (Oroot of 
this probability distribution. In particular, the first two moments read: 

(r) mot = -\+H(N), (21) 

{r 2 ) mo ,= ilH{n-^ (22) 

n=2 H 



where H(n) = £" =1 1/7 is the harmonic number. Because H(N) = lnAf for large Af, the 
mean distance (r) root behaves as \nN. But the dispersion o^ ooi (r) = (r 2 ) mot — ( r ) root — 
InN grows also logarithmically. So if one introduced a scaling variable x = r/\nN, one 
would have in the limit N — > °°: (x) toot — > const and c^otM ~ const/ In Af — > 0, so it is 
not a good scaling variable. Actually, one can solve the recursion relation (19) using a 
generating function formalism. One obtains in the limit N — > °°: 
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(23) 



where, as mentioned before, both (r) and o 2 grow logarithmically. 

The function G Ioot (r,N) is closely related to G^J(r), the two-point function we intro- 

duced before. For large growing trees, G mo t(r,N) becomes almost identical to G^(r), 
because in almost all cases if one randomly chooses a pair of vertices, the shortest path 
between them goes through the root. Only if the two lie on the same branch, the shortest 
path does not contain the root, but the probability that they are on the same branch van- 

(2) 

ishes for N — > °°. Thus for large tree the difference between G c ^ (r) and G T00t (r,N) is in 
the proportionality factor in the average distance: (r) c 2 (r) root ~ 21nA^. To summarize, 
the two point function behaves in a completely different way for causal and homogenous 
trees. 

Also the other statistical characteristics of the causal trees are different from those 
for the homogeneous trees. For instance one can determine the node degree distribution 
averaged over trees in the ensemble: 

P(*) = (^E%-*«))- ( 24 ) 

Here q a is the degree of node a, the sum runs over all nodes of the tree, and the average 
is over all trees in the ensemble. If one calculates this distribution for the homogeneous 
trees one obtains: 

e- 1 

Ph{q) = T^\)V (25) 

while for causal trees: 

Pc(q)=2-«. (26) 



Again we see the difference between the set of all labeled trees and the subset of causally 
labeled trees. 



So far we have discussed the ensembles of unweighted graphs: each labeled graph 
has had a statistical weight equal to 1/Nl independently of its shape. One can easily 
extend those considerations to the ensembles of weighted graphs, in particular to an 
ensemble of graphs for which the statistical weight includes a factor depending on node 
degrees (2). For instance, one can analytically solve a model of trees (both causal and 
homogeneous) whose statistical weight is a product of weights depending on individual 
node degrees. For each node one introduces a weight which is a function of the number 
of edges emerging from it. By tuning the weights one can obtain trees with a desired 
degree distribution. However, the following question arises: assume that we have two 
ensembles of trees with the same node degree distribution. In the first ensemble we have 
all labeled trees and in the second only causally labeled ones. We now remove labels 
from the trees. All we have is the graph topology. Can we distinguish whether a given 
graph topology comes from homogeneous or causal graph ensemble? One can answer 
this question in the affirmative. 

Let us sketch the solution. We will generate scale-free tree graphs by tuning the nodes 
weights. Introducing this to the model, one has to modify the right-hand side of the self- 
consistency equations. The exponential series which we used before in equations (5) and 
(8) to generate equally weighted trees has to be substituted by: 

k=o K - k=0 K - 

Here w(q) is the node weight which depend only on node's degree q. It can be also 
interpreted as branching ratio that a link will split into q — 1 links. In the previous case 
the weights w(q) = 1 were identical for all g's. In order to produce scale-free networks 
with the Barabasi-Albert distribution: 

4 

PBA(q) = - ( -Tw ; rr , (28) 
q(q+l)(q + 2) 

one has to choose w c {q) = (q— 1)! for causal trees and Wh(q) = (q — 1)1 pba(<i) for 
homogeneous trees. In the limit of N — > °° these two ensembles have identical node- 
degree distributions p c (q) = Ph(q) = Pab(i)- To illustrate this we show in figure 5 node 
degree distributions obtained by Monte-Carlo simulations of causal and homogeneous 
trees with the appropriately chosen branching weights. One can see that they perfectly 
follow the distribution (28). In other words, one cannot distinguish to which ensemble 
the tree belongs just by measuring nodes degrees. One can however distinguish the 

(2) 

ensembles very easily if one determines the distance distribution G N (r). As before, 
the causal trees have an infinite fractal dimension, while the homogeneous ones the 

(2) 

fractal dimension equal to two. The distance distribution G N (r) for homogeneous trees 
is plotted in the scaling variable x = rj y/N/ InN on the left-hand side of figure 6 and for 
causal trees in the scaling variable x = r/lnN on the right-hand side. As expected, the 
data for homogeneous trees collapse to a curve independent of N. In the case of c ausal 
trees there is no collapse due to a weak dependence of the curve's width ~ 1/ y/lnN 
on the size N, similarly to the case of unweighted trees mentioned before. The average 
distance scales as (r)/, ~ \/N/ InN in the first case while as (r) c ~ IniV in the second. 




FIGURE 5. The degree distribution for causal (diamonds) and homogeneous (circles) scale-free trees 
measured in Monte-Carlo runs for N = 1000. One sees that they are statistically identical. 




FIGURE 6. Left: The distance distributions for homogeneous trees with BA degree distribution (28) 
plotted in the rescaled variable x = r/^jN/\nN for different N: N = 500, 1000,2000,4000. The con- 
tinuous line is given by the function bxe\p(— ax 2 /2). Right: The distance distributions for causal trees 
with the same distribution (28), plotted in the rescaled variable x = r/\nN for three different sizes 

(2) 

N = 1000, 16000, 128000. The inset shows how the average distance (r) and the width a of G J(r) scale 
with the system size, for N = 1000,2000, . . . , 128000. Both the plot and the inset indicate that a grows 
slower than (r) and thus the curves g c (x) become more narrow while N increases. To observe this effect 
one has to go to much larger sizes than for homogeneous trees. 



Note that we have introduced a logarithmic correction to the square root scaling (11) 
and (18) for the scale-free homogeneous trees. This is due to the fact that in this case the 
series (27) develops a logarithmic singularity. This correction does not affect the fractal 
dimension which is still two. The average distance scales as 



(r) h ~ y/N/hiN . 



(29) 



The distribution (28) belongs to a broader class of scale-free distributions [30]: 



(2 + <p)r(3 + 2<p) T(k+co) 

PKR{q)= r(i + o>) r(fc+3+2a,)' (30) 

which emerge as the limiting distributions in a growth process with the linear attachment 
kernel A q = q + CO, ft) > — 1. The BA distribution (28) corresponds to (0 = 0. The 
mean value of the Pkr(i) distribution is equal two: Y.q1PKR(l) = 2 for all (0 > — 1, 
in accordance with the average number of links per node on a tree. For large q the 
distributions have a power-law tail: Pkr(q) ~ q~ r with the exponent y = 3 + ft), which 
assumes the value from the range y > 2. Interestingly enough, for the scale-free causal 
trees (30) the average distance scales logarithmically (r) c ~ lniV independently of y, 
whereas the scaling properties of the homogeneous trees strongly depend on y. In 
particular the average distance scales as (r) c ~ A^ 1 /^/ where [25, 26, 27] 

rf / = max(2,p£). (31) 

We have J/ = 2 for y > 3. For 2 < y < 3 the fractal dimension df assumes values which 
continuously grow from two to infinity while y decreases from three to two. Please note 
that the logarithmic corrections appear for y = 3. Those corrections can be interpreted 
as the fact that at this point (r) grows slower than but faster than any power 

N l/2-e 

with e > 0. 

The discussion of this section can be summarized as follows. We have considered 
four ensembles of trees: (a) homogeneous uniformly weighted, (b) causal uniformly 
weighted, (c) homogeneous with the BA scale-free distribution (28), (d) causal weighted 
with the BA scale-free distribution. The average distance between nodes in a scale-free 
system is generally smaller than in a random one: so graphs in the ensemble (c) have on 
average smaller diameter than in (a) and in (b) than in (d). This effect is well known. It is 
related to the presence of nodes with high degree which cluster around themselves many 
vertices just in distance one. Another effect which is less known is that the causality 
lowers the distances between nodes on the graph. The effect caused by causality is even 
stronger than by scale-free tails: the graphs in (b) have diameter much smaller than 
those in (a), and similarly those in (c) than those in (d). One observes big changes if 
one imposes the causality constraint. The reason why causality plays such an important 
role in enhancing the small world effect is related to the fact that the oldest vertices, 
in addition to having highest degrees, cluster with each other forming a kernel of the 
graph with extremely high connectivity. Remaining nodes tidily surround this compact 
kernel making the whole graph structure jampacked. In figure 7 we compare the distance 

distribution G N (r) for the cases (a), (c) and (d) for the same system size. We see that 
homogeneous graphs (a) are more elongated than causal ones (c) which in turn are more 
elongated than causal scale-free ones (d). We have not shown the case (b) in the figure 
in order to keep it transparent. The case (b) was compared to (a) in the figure 4. 
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FIGURE 7. The distance distribution Gjy (r) for N = 1 000 for unweighted homogeneous tree (circles), 
scale-free homogeneous trees (diamonds) and scale-free causal trees (squares). 



Statistical ensemble of causal trees discussed in the previous section is for the weights 
w(q) = T(q+ ft)) /r(l + ft)) [19] equivalent to an ensemble of trees obtained by a growth 
process with the linear attachment kernel A q = q + ft) [30]. The map between these 
models is mathematically exact. For non-linear kernels the two models slightly differ 
but both display the same features 3 . If one applies a superlinear attachment kernel in the 
growth process: A q ~ q a for a > 1 one sees the appearance of a singular node which has 
the degree proportional to the size of the tree. If one applies a sublinear kernel a < 1, 
the degree distribution will be suppressed exponentially for large q. The linear kernel is 
marginal in the sense that it lies exactly between the phase with the singular node and 
the exponential tail. A very similar situation takes place for homogeneous trees but the 
mechanism of the emergence of the singular node is different. The model can be mapped 
onto a balls-in-boxes model [32]. In order to obtain scale-free trees one has to choose 
appropriate vertex weights of the model. Any deviation from the fine-tuned values results 
in either the appearance of the singular node or the exponential suppression in the node 
degree distribution, exactly as for growing networks. Roughly speaking, if we denote the 
fine-tuned scale-free distribution po(q) ~ q~ r (the second case in the equation below) 
we have three possible scenarios: 



CONDENSATION 




Po(q)e-™ , 

po(q) » 

Po(q) + jj8(q-pN). 



(32) 



see however the reference [20]. 



In the first case the typical fluctuations of the node degree are of order 1 / ji independently 
of N. In the third case there is a singular node with an extensive number of links q ~ pN. 
This is equivalent to the backgammon condensation of the balls-in-boxes model [32]. 
The appearance of the singular node makes the system to be even more compact than 
for scale-free graphs. The distance between nodes increases slower than logarithmically 
since many vertices are in the closest neighborhood of the singular node. In the extreme 
case which corresponds to the star topology - a vertex surrounded by Af — 1 vertices - 
the average distance is smaller than two and is independent of N. 

Is the condensation a feature of the tree graph ensemble, or it is observed for ensem- 
bles of graphs as well? We have studied this question for homogeneous scale-free graphs 
and pseudographs. In our conventions graphs do not have multiple- and self-connecting 
edges while pseudographs do. For graphs, the condition that they have neither multiple- 
nor self-connections acts as strong constraints on the graph structure which are some- 
times called structural constraints [22, 33]. In particular they strongly prevent the system 
from developing a power-law tail in the node-degree distribution for finite size systems 
[22, 33]. It also turns out they prohibit the condensation which we discussed above. So 
far we have not found any evidence for the backgammon condensation for graphs and 
the emergence of singular node on the graph. On the contrary, for pseudographs the sit- 
uation is very much like for trees and one observes the condensation. We simulated a 
canonical ensemble of homogeneous pseudographs with L links and N vertices, with the 
distribution Pba(i) (28). This was achieved by tuning the node degree weights, similarly 
as we have described for trees before, and by adjusting the ratio q = 2L/N by choosing 
L = N to the mean value of the distribution (q) = Y. q QPBA(l) = 2. Indeed the system 
produced the desired distribution. The next step was to check how the system reacts on 
the change of the number of links . We had increased the number of links L while keeping 
the number of nodes N constant so that the ratio q = 2L/N = 4 exceeded (q) = 2. The 
system reacted as follows. The distribution of the bulk part was as before equal to the 
desired power-law distribution Pba(i) but the system additionally produced a singular 
node which took the surplus of links. The presence of the singular node is manifested 
as a peak in the distribution. The position of the peak moves linearly with the system 
size and the peak departs from the main body of the distribution. Its height is propor- 
tional to \/N since it is a probability of picking up one out of N vertices. The situation 
is depicted in figure 8. This is an example of the backgammon condensation [32]. If one 
adds more links, then the surplus will go to the singular node. One can see this effect 
by comparison of the plots in the left- and right-hand side of figure 8 which differ by 
the ratio q = 2L/N and the positions of the peak correspondingly. If one chooses the 
ratio q = 2L/N to be smaller than (q) the system generates an exponential tail in the 
node degree distribution. This is again what one expects from the balls-in-boxes model. 
The reason why this analogy works for pseudographs and does not for graphs is that for 
pseudographs the degrees of individual vertices are almost independent of each other 
except of the global constraint q\ + qi + . . . + q^ = 2L, while they are in a subtle way 
correlated for graphs due to the structural constraints. 




FIGURE 8. Left: The node degree distribution for scale-free pseudographs with q = 2L/N = 4 which 
is above the condensation threshold (q) = 2 of the BA distribution (28). The main part of the distribution 
goes along the limiting BA curve (solid line). The peak represents the singular vertex. Its position moves 
linearly with the system size N = 200,400, 800. Right: The same for larger link density: q — 2L/N = 8. 



SUMMARY 

We have applied methods of statistical mechanics to compare ensembles of homoge- 
neous and causal (growing) networks. We have shown that the causality strengthens 
the small world effect due to clustering of nodes with high connectivity in the kernel 
of the graph. In particular, for homogeneous random trees the average inter-node dis- 
tance scales as while for causal networks as \nN. We have compared two ensembles 
of random trees with Barabasi-Albert distribution and observed that despite they have 
identical degree distribution, the causal trees have much smaller diameter than the cor- 
responding homogeneous trees. 

We have also discussed the stability of the scale-free distributions. For growing 
network such distributions emerge for linear attachments kernels. If the attachment is 
slightly perturbed the system either exponentially suppresses nodes with higher degree 
or develops a singular node with a degree proportional to the total number of links. 
The first type of perturbation corresponds to sublinear while the second to superlinear 
kernels. Similar instabilities are observed for homogeneous pseudographs. Scale-free 
distributions require a fine-tuning of the weight parameters. A slight perturbation, as 
before, also leads either to the exponential suppression or to the emergence of a singular 
node on the graph. In this case the singular node emerges as a result of a condensation 
of the backgammon type [32]. 
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